
brainGraph_GLM_design
takes a data.table
of covariates and
returns a design matrix to be used in linear model analysis.
brainGraph_GLM_design(covars, coding = c("dummy", "effects",
"cell.means"), factorize = TRUE, binarize = NULL, int = NULL,
mean.center = FALSE, center.how = c("all", "within-groups"),
center.by = getOption("bg.group"))
A numeric matrix. Rownames are subject ID's and column names are the variable names. There will be additional attributes recording the
coding
, factorize
, and mean.center
function arguments.
There will also be attributes for binarize
and int
if they
are not NULL
, and center.how
and center.by
if
mean.center=TRUE
.
A data.table
of covariates
Character string indicating how factor variables will be coded.
Default: 'dummy'
Logical indicating whether to convert character
columns into factor. Default: TRUE
Character vector specifying the column name(s) of the
covariate(s) to be converted from type factor
to numeric
.
Default: NULL
Character vector specifying the column name(s) of the
covariate(s) to test for an interaction. Default: NULL
Logical indicating whether to mean center non-factor
variables. Default: FALSE
Character string indicating whether to use the grand mean
or groupwise means. Default: 'all'
Character string indicating which grouping variable to use
for calculating means (if applicable). Default: 'Group'
The default behavior is to convert all character columns (excluding the Study
ID column and any that you list in the binarize
argument) to factor
variables. To change this, set factorize=FALSE
. So, if your covariates
include multiple character columns, but you want to convert Scanner to
binary instead of a factor, you may still specify binarize='Scanner'
and get the expected result. binarize
will convert the given factor
variable(s) into numeric variable(s), which is performed before
centering (if applicable).
The argument mean.center
will mean-center (i.e., subtract the mean of
from each variable) any non-factor variables (including any dummy/indicator
covariates). This is done after “factorizing” and
“binarizing”. If center.how='all'
, then the “grand mean”
will be used; otherwise, the groupwise means will be used. The grouping
variable is determined by center.by
and is by default 'Group'
.
int
specifies which variables should interact with one another. This
argument accepts both numeric/continuous (e.g., Age) and factor
variables (e.g., Sex). All interaction combinations will be generated:
if you supply 3 variables, all two-way and the single three-way interaction
will be generated. This variable must have at least two elements; it
is otherwise ignored. It is generally recommended that centering be performed
when including interaction terms.
Christopher G. Watson, cgwatson@bu.edu
There are three different ways to code factors: dummy, effects,
or cell-means (chosen by the argument coding
). Effects
coding is sometimes referred to as deviation coding. Dummy
coding is the default when calling lm
. To understand the difference
between these, see Chapter 8 of the User Guide.
Other GLM functions: GLM fits
,
GLM
, mtpc
if (FALSE) {
# Recreate design matrix when "outcome == measure"
DT <- res.glm$DT.Xy[region == levels(region)[1L],
!c('region', res.glm$outcome),
with=FALSE]
X <- do.call(brainGraph_GLM_design, c(list(covars=DT),
attributes(res.glm$X)[-c(1L, 2L)]))
all.equal(X, res.glm$X)
}
Run the code above in your browser using DataLab